Microdata Protection Method Through Microaggregation: A Median-Based Approach
نویسندگان
چکیده
Microaggregation for Statistical Disclosure Control (SDC) is a family of methods to protect microdata from individual identification. SDC seeks to protect microdata in such a way that can be published and mined without providing any private information that can be linked to specific individuals. The aim of SDC is to modify the original microdata in such a way that the modified data and the original data are similar. Microaggregation works by partitioning the microdata into groups, also called clusters of at least k records and then replacing the records in each group with the centroid of the group. In this work we introduce a new microaggregation method, where the centroid is considered as median. The new method guarantees that the microaggregated data and the original data are similar by using statistical test. Another contribution of this work is that we propose a distance metric, called absolute deviation from median (ADM) to evaluate the amount of mutual information among records in microdata. We showed that ADM is always less than the most commonly used measure of distortion called sum of squares of errors (SSE) for any dataset. Thus ADM causes least information loss and can be used as a measure of information loss for a microaggregated microdata set.
منابع مشابه
Microdata Protection Method Through Microaggregation: A Systematic Approach
Microdata protection in statistical databases has recently become a major societal concern and has been intensively studied in recent years. Statistical Disclosure Control (SDC) is often applied to statistical databases before they are released for public use. Microaggregation for SDC is a family of methods to protect microdata from individual identification. SDC seeks to protect microdata in s...
متن کاملMicrodata Protection Through Approximate Microaggregation
Microdata protection is a hot topic in the field of Statistical Disclosure Control, which has gained special interest after the disclosure of 658000 queries by the America Online (AOL) search engine in August 2006. Many algorithms, methods and properties have been proposed to deal with microdata disclosure. One of the emerging concepts in microdata protection is kanonymity, introduced by Samara...
متن کاملAn approximate microaggregation approach for microdata protection
Microdata protection is a hot topic in the field of Statistical Disclosure Control, which has gained special interest after the disclosure of 658000 queries by the America Online (AOL) search engine in August 2006. Many algorithms, methods and properties have been proposed to deal with microdata disclosure. One of the emerging concepts in microdata protection is k-anonymity, introduced by Samar...
متن کاملA Comparative Study on Microaggregation Techniques for Microdata Protection
Microaggregation is an efficient Statistical Disclosure Control (SDC) perturbative technique for microdata protection. It is a unified approach and naturally satisfies k-Anonymity without generalization or suppression of data. Various microaggregation techniques: fixed-size and data-oriented for univariate and multivariate data exists in the literature. These methods have been evaluated using t...
متن کاملFuzzy Microaggregation for Microdata Protection
In this work we describe a microdata protection method based on the use of fuzzy clustering and, more specifically, using fuzzy c-means. Microaggregation is a well-known masking method for microdata protection used by National Statistical Offices. Given a set of objects described in terms of a set of variables, this method consists on building a partition of the objects and then replace the ori...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Information Security Journal: A Global Perspective
دوره 20 شماره
صفحات -
تاریخ انتشار 2011